Search CORE

21 research outputs found

Who wants to join me? Companion recommendation in location based social networks

Author: Cheng Z.
Eisenstein J.
Feng S.-S.
Gao H.
Heinrich G.
Kahanda I.
Lim K. H.
Qiao Z.
Romero D. M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/09/2016
Field of study

We consider the problem of identifying possible companions for a user who is planning to visit a given venue. Specifically, we study the task of predicting which of the user's current friends, in a location based social network (LBSN), are most likely to be interested in joining the visit. An important underlying assumption of our model is that friendship relations can be clustered based on the kinds of interests that are shared by the friends. To identify these friendship types, we use a latent topic model, which moreover takes into account the geographic proximity of the user to the location of the proposed venue. To the best of our knowledge, our model is the first that addresses the task of recommending companions for a proposed activity. While a number of existing topic models can be adapted to make such predictions, we experimentally show that such methods are significantly outperformed by our model

Crossref

Online Research @ Cardiff

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Author: Alborzi S. Z.
Altenhoff A.
Amezola M.
Antczak M.
Aridhi S.
Asgari E.
Atalay V.
Babbitt P. C.
Barot M.
Ben-Hur A.
Benso A.
Bergquist T. R.
Berselli M.
Bhat P.
Bjorne J.
Black G. S.
Boecker F.
Bonneau R.
Borukhov I.
Bosco G.
Boudellioua I.
Brackenridge D. A.
Brenner S. E.
Cao R.
Carraro M.
Casadio R.
Cetin Atalay R.
Chandler C.
Chang J. -M.
Cheng J.
Chi P. -H.
Cozzetto D.
Crocker A. W.
Dai S.
Dalklran A.
Das S.
Davidovic R. S.
Davis L.
Dayton J. B.
Dessimoz C.
Devignes M. -D.
Di Carlo S.
Dogan T.
Dzeroski S.
Fa R.
Fabris F.
Falda M.
Fang H.
Fernandez J. M.
Fontana P.
Frank Y.
Frasca M.
Freddolino P. L.
Freitas A. A.
Friedberg I.
Gemovic B.
Georghiou G.
Ginter F.
Gligorijevic V.
Goldberg T.
Gough J.
Greene C. S.
Grossi G.
Hakala K.
Hamid M. N.
Hoehndorf R.
Hogan D. A.
Holm L.
Hou J.
Hurto R. L.
Jain A.
Jeffery C. J.
Jiang Y.
Jo D.
Johnson D.
Jones D. T.
Kacsoh B. Z.
Kaewphan S.
Kahanda I.
Kihara D.
Koo D. C. E.
Kulmanov M.
Larsen D. J.
Lavezzo E.
Lee A. J.
Lees J. G.
Lewis K. A.
Liao W. -H.
Lichtarge O.
Linial M.
Liu Y. -W.
Mao Q.
Martelli P. L.
Martin M. J.
McGuffin L. J.
McHardy A. C.
Medlar A. J.
Mehryary F.
Mesiti M.
Moen H.
Mofrad M. R. K.
Mooney S. D.
Nguyen H. N.
Notaro M.
Novikov I.
O'Donovan C.
Omdahl A. R.
Orengo C. A.
Paccanaro A.
Pascarelli S.
Perovic V. R.
Petrini A.
Piovesan D.
Politano G.
Profiti G.
Radivojac P.
Re M.
Reeb J.
Renaux A.
Rifaioglu A. S.
Ritchie D. W.
Roche D. B.
Rodriguez J. M.
Romero A. E.
Rose P. W.
Rost B.
Saidi R.
Salakoski T.
Savojardo C.
Schoof H.
Sillitoe I.
Smuc T.
Suh E.
Sumonja N.
Supek F.
Thurlby N.
Tian W.
Tolvanen M. E. E.
Toppo S.
Toronen P.
Torres M.
Tosatto S. C. E.
Tress M. L.
Tseng W. -C.
Ur Rehman H.
Valentini G.
Veljkovic N.
Vidulin V.
Vucetic S.
Wan C.
Wang Z.
Warwick Vesztrocy A.
Wass M. N.
Wilkins A.
Yang H.
Yao S.
You R.
Yunes J. M.
Zhang C.
Zhang F.
Zhang S.
Zhang Y.
Zhang Z.
Zhao C.
Zhou N.
Zhu S.
Zosa E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole genome mutation screening in Candida albicans and aeruginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

The Interaction Network Ontology-supported modeling and mining of complex interactions represented with multiple keywords in biomedical literature

Author: A Airola
A Doms
A Ozgur
A Petersohn
Arzucan Özgür
AX Chang
B Zhang
C Bettembourg
C Jonquet
CN Arighi
D Tikk
E Beisswanger
G Erkan
H Antelmann
H Ichikawa
I Bagyan
I Kahanda
I Karadeniz
J Hur
J Hur
J Hur
J Hur
J Park
JD Kim
JD Kim
JM Temkin
Junguk Hur
K Drzewiecki
K Fundel
L Tanabe
M Ashburner
M Dai
M Jiang
M Krallinger
M Marneffe
N Daraselia
NH Shah
P Grenon
R Isserlin
R Jelier
R Schroeter
RG Webster
T Joachims
Yongqun He
Z Xiang
Z Xiang
Z Xiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Author: Almeida-e-Silva DC
Altenhoff A
Babbitt PC
Bankapur AR
Bargsten JW
Ben-Hur A
Benso A
Bhat P
Bonneau R
Brenner SE
Bryson K
Cao RZ
Casadio R
Cejuela JM
Chapman S
Chen CT
Cheng JL
Cibrian-Uhalte E
Clark WT
Cozzetto D
D'Andrea D
Das S
Dawson NL
del Pozo A
Denny P
Dessimoz C
Di Carlo S
Dogan T
Dukka BKC
ElShal S
Falda M
Fang H
Feng S
Fernandez JM
Ferrari C
Fontana P
Foulger RE
Friedberg I
Funk CS
Gabaldon T
Gemovic B
Gillis J
Ginter F
Giollo M
Glisic S
Goldberg T
Gong QT
Gough J
Greene CS
Hakala K
Hamp T
Hieta R
Holm L
Hsu WL
Huntley RP
Jiang YX
Jones DT
Kaewphan S
Kahanda I
Kansakar L
Khan IK
Kihara D
Koo DCE
Koskinen P
Lavezzo E
Lee D
Lees JG
Legge D
Lepore R
Li B
Lin A
Linial M
Lovering RC
Magrane M
Maietta P
Marcet-Houben M
Martelli PL
Martin MJ
Mehryary F
Melidoni AN
Mesiti M
Minneci F
Mooney SD
Moreau Y
Mutowo-Meullenet P
Nepusz T
Ning W
O'Donovan C
Oates M
Ofer D
Orengo CA
Oron TR
Paccanaro A
Pavlidis P
Penfold-Brown D
Perovic V
Pichler K
Piovesan D
Politano G
Profiti G
Radivojac P
Rappoport N
Re M
Rehman HU
Richter L
Robinson PN
Romero AE
Rost B
Sahraeian SME
Salakoski T
Salamov A
Sasidharan R
Savino A
Sedeno-Cortes AE
Sharan M
Shasha D
Shypitsyna A
Sillitoe I
Skunca N
Smithers B
Stern A
Sternberg MJE
Supek F
Tian WD
Toppo S
Toronen P
Tosatto SCE
Tramontano A
Tranchevent LC
Tress ML
Valencia A
Valentini G
van Dijk ADJ
Veljkovic N
Veljkovic V
Vencio RZN
Verspoor KM
Vogel J
Vucetic S
Wang Z
Wass MN
Yang HX
Youngs N
Zakeri P
Zhang S
Zhong Z
Zhou YP
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2022
Field of study

Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

UTUPub

PHENOstruct: Prediction of human phenotype ontology terms using heterogeneous data sources.

Author: Ben-Hur A
Funk C
Kahanda I
Verspoor K
Publication venue: 'F1000 Research Ltd'
Publication date: 01/01/2015
Field of study

The human phenotype ontology (HPO) was recently developed as a standardized vocabulary for describing the phenotype abnormalities associated with human diseases. At present, only a small fraction of human protein coding genes have HPO annotations. But, researchers believe that a large portion of currently unannotated genes are related to disease phenotypes. Therefore, it is important to predict gene-HPO term associations using accurate computational methods. In this work we demonstrate the performance advantage of the structured SVM approach which was shown to be highly effective for Gene Ontology term prediction in comparison to several baseline methods. Furthermore, we highlight a collection of informative data sources suitable for the problem of predicting gene-HPO associations, including large scale literature mining data

Directory of Open Access Journals

University of Melbourne Institutional Repository

A close look at protein function prediction evaluation protocols

Author: Ben-Hur A
Funk CS
Kahanda I
Ullah F
Verspoor KM
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/09/2015
Field of study

BACKGROUND: The recently held Critical Assessment of Function Annotation challenge (CAFA2) required its participants to submit predictions for a large number of target proteins regardless of whether they have previous annotations or not. This is in contrast to the original CAFA challenge in which participants were asked to submit predictions for proteins with no existing annotations. The CAFA2 task is more realistic, in that it more closely mimics the accumulation of annotations over time. In this study we compare these tasks in terms of their difficulty, and determine whether cross-validation provides a good estimate of performance. RESULTS: The CAFA2 task is a combination of two subtasks: making predictions on annotated proteins and making predictions on previously unannotated proteins. In this study we analyze the performance of several function prediction methods in these two scenarios. Our results show that several methods (structured support vector machine, binary support vector machines and guilt-by-association methods) do not usually achieve the same level of accuracy on these two tasks as that achieved by cross-validation, and that predicting novel annotations for previously annotated proteins is a harder problem than predicting annotations for uncharacterized proteins. We also find that different methods have different performance characteristics in these tasks, and that cross-validation is not adequate at estimating performance and ranking methods. CONCLUSIONS: These results have implications for the design of computational experiments in the area of automated function prediction and can provide useful insight for the understanding and design of future CAFA competitions

Springer - Publisher Connector

University of Melbourne Institutional Repository

Using strong triadic closure to characterize ties in social networks

Author: Huberman B. A.
Jones J. J.
Kahanda I.
Knuth D. E.
Newcomb T. M.
Vazirani V. V.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Feature Selection for Social Media Data

Author: Argyriou A.
Gao H.
He X.
Huan Liu
Jensen D.
Jiliang Tang
Kahanda I.
Liu J.
Nie F.
Roth V.
Taskar B.
Zhao Z.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Genetic dissection of natural variation in oilseed traits of camelina by whole-genome resequencing and QTL mapping

Author: Amirebrahimi M.
Barry K.
Chen C.
Grabowski P.P.
Hu X.
Kahanda I.
Kudrna D.
Lachowiec J.
Li H.
Lovell J.T.
Lu C.
Mamidi S.
Mumey B.
Schmutz J.
Publication venue: 'Wiley'
Publication date: 01/01/2021
Field of study

Camelina [Camelina sativa (L.) Crantz] is an oilseed crop in the Brassicaceae family that is currently being developed as a source of bioenergy and healthy fatty acids. To facilitate modern breeding efforts through marker-assisted selection and biotechnology, we evaluated genetic variation among a worldwide collection of 222 camelina accessions. We performed whole-genome resequencing to obtain single nucleotide polymorphism (SNP) markers and to analyze genomic diversity. We also conducted phenotypic field evaluations in two consecutive seasons for variations in key agronomic traits related to oilseed production such as seed size, oil content (OC), fatty acid composition, and flowering time. We determined the population structure of the camelina accessions using 161,301 SNPs. Further, we identified quantitative trait loci (QTL) and candidate genes controlling the above field-evaluated traits by genome-wide association studies (GWAS) complemented with linkage mapping using a recombinant inbred line (RIL) population. Characterization of the natural variation at the genome and phenotypic levels provides valuable resources to camelina genetic studies and crop improvement. The QTL and candidate genes should assist in breeding of advanced camelina varieties that can be integrated into the cropping systems for the production of high yield of oils of desired fatty acid composition. © 2021 The Authors. The Plant Genome published by Wiley Periodicals LLC on behalf of Crop Science Society of AmericaOpen access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

Directory of Open Access Journals

The University of Arizona

eScholarship - University of California

Detection of stealthy malware activities with traffic causality and scalable triggering relation discovery

Author: Bilge L.
Chen X.
Choi H.-K.
Cui W.
Elkan C.
Gu G.
Gummadi R.
Kahanda I.
King S. T.
Kolbitsch C.
Lee W.
Li Z.
Livadas C.
Stefan D.
Xie P.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref